Model Selection

Pre-training Optimization

# Pre-training Optimization

MrT5 is an efficient byte-level language model based on ByT5 improvements, reducing input sequence length by approximately 50% through dynamic token merging technology

Large Language Model

Transformers Supports Multiple Languages

Llama 3 70B Special Tokens Adjusted

A special token adjusted version optimized based on Meta-Llama-3-70B, fixing fine-tuning issues caused by untrained special tokens in the original model

Large Language Model

Bert Mlm Medium

A medium-sized BERT language model using Masked Language Modeling (MLM) as the pre-training objective.

Large Language Model

A medium-scale BERT language model that uses first character prediction as the pre-training objective.

Large Language Model

Randeng Pegasus 523M Chinese

A Chinese version of the PAGASUS-large model specialized in text summarization tasks, trained on the PEGASUS architecture with optimizations for Chinese tokenization.

Text Generation

Transformers Chinese

Reasonbert TAPAS

This model is based on the tapas-base architecture, optimized for table input through pre-training, enhancing reasoning capabilities for QA tasks.

Large Language Model

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase